Autoregressive HMMs for speech synthesis
نویسندگان
چکیده
We propose the autoregressive HMM for speech synthesis. We show that the autoregressive HMM supports efficient EM parameter estimation and that we can use established effective synthesis techniques such as synthesis considering global variance with minimal modification. The autoregressive HMM uses the same model for parameter estimation and synthesis in a consistent way, in contrast to the standard HMM synthesis framework, and supports easy and efficient parameter estimation, in contrast to the trajectory HMM. We find that the autoregressive HMM gives performance comparable to the standard HMM synthesis framework on a Blizzard Challenge-style naturalness evaluation.
منابع مشابه
A practical perceptual frequency autoregressive HMM enhancement system
We have previously developed a speech enhancement scheme which can adapt to unknown additive noise. We model speech and noise using perceptual frequency or ‘warped’ autoregressive HMMs (AR-HMMs) and estimate the clean speech and noise parameters within this framework. In this current work, we investigate the use of our system as a front end to a MFCC recognition system trained on clean speech. ...
متن کاملUsing AR HMM state-dependent filtering for speech enhancement
In this paper we address the problem of enhancing speech which has been degraded by additive noise. As proposed by Ephraim et al., autoregressive hidden Markov models (AR-HMM) for the clean speech and an autoregressive Gaussian for the noise are used. The filter applied to a given frame of noisy speech is estimated using the noise model and the autoregressive Gaussian having the highest a poste...
متن کاملNonlinear mixture autoregressive hidden Markov models for speech recognition
Gaussian mixture models are a very successful method for modeling the output distribution of a state in a hidden Markov model (HMM). However, this approach is limited by the assumption that the dynamics of speech features are linear and can be modeled with static features and their derivatives. In this paper, a nonlinear mixture autoregressive model is used to model state output distributions (...
متن کاملEnhancement and recognition of noisy speech within an autoregressive hidden Markov model framework using noise estimates from the noisy signal
This paper describes a new algorithm to enhance and recognise noisy speech when only the noisy signal is available. The system uses autoregressive hidden Markov models (HMMs) to model the clean speech and noise and combines these to form a model for the noisy speech. The probability framework developed is then used to reestimate the noise models from the corrupted speech waveform and the proces...
متن کاملConnectionist Approaches to the Use of Markov Models for Speech Recognition
Previous work has shown the ability of Multilayer Perceptrons (MLPs) to estimate emission probabilities for Hidden Markov Models (HMMs). The advantages of a speech recognition system incorporating both MLPs and HMMs are the best discrimination and the ability to incorporate multiple sources of evidence (features, temporal context) without restrictive assumptions of distributions or statistical ...
متن کامل